binding free energy
Supplementary Materials A Protein Targets Chosen for Generation
Figure A.1 shows the amino acid sequences corresponding to the three SARS-CoV -2 targets. We used a bidirectional Gated Recurrent Unit (GRU) with a linear output layer as an encoder. Figure B.1: The novelty of the scaffold of each generated molecule compared to the most similar scaffold in the training set. Similarity of the fingerprints, is shown next to the scaffold of each generated molecule. We show a representative set of molecules generated for each target in Figure D.1 Figure D.1: Representative molecules generated for (top to bottom): NSP9 Replicase, Receptor-Binding Domain (RBD) of S protein, and Main Protease of SARS-CoV -2 RBD has maximum subgraph similarity to a commercially available drug Telavancin (See Figure E.3).
- Health & Medicine > Therapeutic Area > Pulmonary/Respiratory Diseases (1.00)
- Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
- Government > Regional Government > North America Government > United States Government > FDA (0.47)
Energy-Based Models for Predicting Mutational Effects on Proteins
Soga, Patrick, Lei, Zhenyu, He, Yinhan, Bilodeau, Camille, Li, Jundong
Predicting changes in binding free energy ($ΔΔG$) is a vital task in protein engineering and protein-protein interaction (PPI) engineering for drug discovery. Previous works have observed a high correlation between $ΔΔG$ and entropy, using probabilities of biologically important objects such as side chain angles and residue identities to estimate $ΔΔG$. However, estimating the full conformational distribution of a protein complex is generally considered intractable. In this work, we propose a new approach to $ΔΔG$ prediction that avoids this issue by instead leveraging energy-based models for estimating the probability of a complex's conformation. Specifically, we novelly decompose $ΔΔG$ into a sequence-based component estimated by an inverse folding model and a structure-based component estimated by an energy model. This decomposition is made tractable by assuming equilibrium between the bound and unbound states, allowing us to simplify the estimation of degeneracies associated with each state. Unlike previous deep learning-based methods, our method incorporates an energy-based physical inductive bias by connecting the often-used sequence log-odds ratio-based approach to $ΔΔG$ prediction with a new $ΔΔE$ term grounded in statistical mechanics. We demonstrate superiority over existing state-of-the-art structure and sequence-based deep learning methods in $ΔΔG$ prediction and antibody optimization against SARS-CoV-2.
- North America > United States > Virginia > Albemarle County > Charlottesville (0.14)
- North America > Canada > Ontario > Toronto (0.05)
- South America > Brazil (0.04)
- (3 more...)
HelixDesign-Antibody: A Scalable Production-Grade Platform for Antibody Design Built on HelixFold3
Gao, Jie, Hu, Jing, Zhang, Shanzhuo, Zhu, Kunrui, Qian, Sheng, Huang, Yueyang, Zhang, Xiaonan, Fang, Xiaomin
Antibody engineering is essential for developing therapeutics and advancing biomedical research. Traditional discovery methods often rely on time-consuming and resource-intensive experimental screening. To enhance and streamline this process, we introduce a production-grade, high-throughput platform built on HelixFold3, HelixDesign-Antibody, which utilizes the high-accuracy structure prediction model, HelixFold3. The platform facilitates the large-scale generation of antibody candidate sequences and evaluates their interaction with antigens. Integrated high-performance computing (HPC) support enables high-throughput screening, addressing challenges such as fragmented toolchains and high computational demands. Validation on multiple antigens showcases the platform's ability to generate diverse and high-quality antibodies, confirming a scaling law where exploring larger sequence spaces increases the likelihood of identifying optimal binders. This platform provides a seamless, accessible solution for large-scale antibody design and is available via the antibody design page of PaddleHelix platform.
HelixDesign-Binder: A Scalable Production-Grade Platform for Binder Design Built on HelixFold3
Gao, Jie, Li, Jun, Hu, Jing, Zhang, Shanzhuo, Zhu, Kunrui, Huang, Yueyang, Zhang, Xiaonan, Fang, Xiaomin
Protein binder design is central to therapeutics, diagnostics, and synthetic biology, yet practical deployment remains challenging due to fragmented workflows, high computational costs, and complex tool integration. We present HelixDesign-Binder, a production-grade, high-throughput platform built on HelixFold3 that automates the full binder design pipeline, from backbone generation and sequence design to structural evaluation and multi-dimensional scoring. By unifying these stages into a scalable and user-friendly system, HelixDesign-Binder enables efficient exploration of binder candidates with favorable structural, energetic, and physicochemical properties. The platform leverages Baidu Cloud's high-performance infrastructure to support large-scale design and incorporates advanced scoring metrics, including ipTM, predicted binding free energy, and interface hydrophobicity. Benchmarking across six protein targets demonstrates that HelixDesign-Binder reliably produces diverse and high-quality binders, some of which match or exceed validated designs in predicted binding affinity. HelixDesign-Binder is accessible via an interactive web interface in PaddleHelix platform, supporting both academic research and industrial applications in antibody and protein binder development.
Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions
Jiao, Xiaoran, Mao, Weian, Jin, Wengong, Yang, Peiyuan, Chen, Hao, Shen, Chunhua
Predicting the change in binding free energy ($\Delta \Delta G$) is crucial for understanding and modulating protein-protein interactions, which are critical in drug design. Due to the scarcity of experimental $\Delta \Delta G$ data, existing methods focus on pre-training, while neglecting the importance of alignment. In this work, we propose the Boltzmann Alignment technique to transfer knowledge from pre-trained inverse folding models to $\Delta \Delta G$ prediction. We begin by analyzing the thermodynamic definition of $\Delta \Delta G$ and introducing the Boltzmann distribution to connect energy with protein conformational distribution. However, the protein conformational distribution is intractable; therefore, we employ Bayes' theorem to circumvent direct estimation and instead utilize the log-likelihood provided by protein inverse folding models for $\Delta \Delta G$ estimation. Compared to previous inverse folding-based methods, our method explicitly accounts for the unbound state of protein complex in the $\Delta \Delta G$ thermodynamic cycle, introducing a physical inductive bias and achieving both supervised and unsupervised state-of-the-art (SoTA) performance. Experimental results on SKEMPI v2 indicate that our method achieves Spearman coefficients of 0.3201 (unsupervised) and 0.5134 (supervised), significantly surpassing the previously reported SoTA values of 0.2632 and 0.4324, respectively. Futhermore, we demonstrate the capability of our method on binding energy prediction, protein-protein docking and antibody optimization tasks.
- Oceania > Australia > South Australia > Adelaide (0.04)
- North America > United States > Maryland > Baltimore (0.04)
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- Asia > China (0.04)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
- Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)
SPIN: SE(3)-Invariant Physics Informed Network for Binding Affinity Prediction
Choi, Seungyeon, Seo, Sangmin, Park, Sanghyun
Accurate prediction of protein-ligand binding affinity is crucial for rapid and efficient drug development. Recently, the importance of predicting binding affinity has led to increased attention on research that models the three-dimensional structure of protein-ligand complexes using graph neural networks to predict binding affinity. However, traditional methods often fail to accurately model the complex's spatial information or rely solely on geometric features, neglecting the principles of protein-ligand binding. This can lead to overfitting, resulting in models that perform poorly on independent datasets and ultimately reducing their usefulness in real drug development. To address this issue, we propose SPIN, a model designed to achieve superior generalization by incorporating various inductive biases applicable to this task, beyond merely training on empirical data from datasets. For prediction, we defined two types of inductive biases: a geometric perspective that maintains consistent binding affinity predictions regardless of the complexs rotations and translations, and a physicochemical perspective that necessitates minimal binding free energy along their reaction coordinate for effective protein-ligand binding. These prior knowledge inputs enable the SPIN to outperform comparative models in benchmark sets such as CASF-2016 and CSAR HiQ. Furthermore, we demonstrated the practicality of our model through virtual screening experiments and validated the reliability and potential of our proposed model based on experiments assessing its interpretability.
- North America > United States (0.28)
- Asia > South Korea > Seoul > Seoul (0.04)
An open-source molecular builder and free energy preparation workflow - Communications Chemistry
Automated free energy calculations for the prediction of binding free energies of congeneric series of ligands to a protein target are growing in popularity, but building reliable initial binding poses for the ligands is challenging. Here, we introduce the open-source FEgrow workflow for building user-defined congeneric series of ligands in protein binding pockets for input to free energy calculations. For a given ligand core and receptor structure, FEgrow enumerates and optimises the bioactive conformations of the grown functional group(s), making use of hybrid machine learning/molecular mechanics potential energy functions where possible. Low energy structures are optionally scored using the gnina convolutional neural network scoring function, and output for more rigorous protein–ligand binding free energy predictions. We illustrate use of the workflow by building and scoring binding poses for ten congeneric series of ligands bound to targets from a standard, high quality dataset of protein–ligand complexes. Furthermore, we build a set of 13 inhibitors of the SARS-CoV-2 main protease from the literature, and use free energy calculations to retrospectively compute their relative binding free energies. FEgrow is freely available at https://github.com/cole-group/FEgrow , along with a tutorial. Automated free energy calculations for the prediction of binding free energies of ligands to a protein target are gaining importance for drug discovery, but building reliable initial binding poses for the ligands is challenging. Here, the authors introduce an open-source workflow for building user-defined congeneric series of ligands in protein binding pockets for input to free energy calculations.
New Machine learning tool can accelerate drug discovery
Machine learning can quickly and precisely evaluate binding free energy used in drug discovery, according to a March 15 study published in The Journal of Physical Chemistry Letters. The new machine learning tool, known as DeepBAR, was discovered by Xinqiang Ding, PhD, and Bin Zhang, PhD, researchers from the Massachusetts Institute of Technology in Cambridge. Drugs are only effective if they stick to their target proteins in the body, which can slow down drug discovery. Existing techniques struggle to balance efficiency and accuracy, researchers said. DeepBAR can accelerate the process because it is much quicker than other methods currently available.
Quickly Calculating Drug–Target Binding Affinity With Machine Learning
Drugs can only work if they stick to their target proteins in the body. Assessing that stickiness is a key hurdle in the drug discovery and screening process. The new technique, dubbed DeepBAR, quickly calculates the binding affinities between drug candidates and their targets. The approach yields precise calculations in a fraction of the time compared to previous state-of-the-art methods. The researchers say DeepBAR could one day quicken the pace of drug discovery and protein engineering.